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Abstract 



We further develop the constrained mass variable techniques to determine the mass scale of invisible 
particles pair-produced at hadron colliders. We introduce the constrained mass variable Mzc which 
provides an event-by-event lower bound and upper bound to the mass scale given the two mass differences 
i-C ' between the lightest three new particle states. This variable is most appropriate for short symmetric 

cascade decays involving two-body decays and on-shell intermediate states which end in standard-model 
f*) ■ particles and two dark-matter particles. An important feature of the constrained mass variables is that 

' they do not rely simply on the position of the end point but use the additional information contained 

, in events which lie far from the end point. To demonstrate our method we study the supersymmetric 

model SPS la. We select cuts to study events with two X2 each of which decays to x°> an d two opposite- 
sign same-flavor (OSSF) charged leptons through an intermediate on-shell slepton. We find that with 
300 fb _1 of integrated luminosity the invisible-particle mass can be measured to Mjj = 96.4 ± 2.4 GeV. 
qq ' Combining fits to the shape of the Mzc constrained mass variable distribution with the maxmu edge 

\ fixes the mass differences to ±0.2 GeV. 



1 Introduction 



qq ' If dark matter is produced at a hadron collider, the likely signature will be missing transverse momentum. 

In previous papers, we have introduced the constrained mass variable M 2C § g as a means to determine 
the mass of the dark matter. The main concept behind the constrained mass variable M 2 c is that after 
studying several kinematic quantities we may have well determined the mass difference between two particle 
states but not the mass itself. We then incorporate these additional constraints in the analysis of the events. 
We check each event to test the lower bounds and upper bounds on the mass scale that still satisfies the 
mass difference and the on-shell conditions for the assumed topology. Because the domain over which we 
are minimizing contains the true value for the mass, the end points of the lower-bounds' and upper-bounds' 
distributions give the true mass. 

In previous studies the constrained mass variable M 2 c was introduced and studied. The constrained 
mass variable M 2 c assumes a new state Y decays to visible particles (1) and (2) and N through a three-body 
decay in which the mass difference is given by the end point of the invariant-mass distribution of the two 
visible states on a branch 

maxmi 2 = (My - M N ) 2 . (1) 

In the situations we study, visible states (1) and (2) are leptons so we will refer to this edge as maxm;;. 
We also note that the M 2 c may also compliment or cross-check other mass-determination techniques ||, []|. 
|], [| |t], ||, H], |l(], O, [l2|, [l3| [l4|, [l5| which may somehow otherwise determine the mass difference but may 
have a remaining ambiguity on the mass scale. The purpose of this paper is extending the constrained mass 
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Figure 1: Events where the new state Y is pair produced and in which each Y decays through a two-body 
decay to a massive new state X and a visible state 1 and then where X subsequently decays to a massive state 
N invisible to the detector and visible state 2. All previous decay products are grouped into the upstream 
transverse momentum, k. 



variable to the case with three new on-shell states as depicted in Fig. [j]. With an on-shell intermediate state, 
the kinematic edge from the end point of the invariant-mass distribution of the visible states (1) and (2) on 
a branch gives the relationship 



max m j 2 = 



(M*-M*)(M% 



Ml 



(2) 



Each event now satisfies an additional set of on-shell constraints so the events should contain more infor- 
mation. Because Eq. (jl|) does not give the mass difference and because the M2C variable docs not use the 
additional information available from having three on-shell states in each event, then a better variable with 
which to find the mass scale likely exists by incorporating this missing information in the extremization. 

In this paper we introduce a constrained mass variable more appropriate for this case, one with an on- 
shell intermediate state, which we will call M^c- The variable M^c differs from M^c in that we assume an 
on-shell intermediate state X connects the two visible decay products so there are three new states and two 
relevant mass differences. We structure the paper around a case study of the supersymmetry benchmark 
point SPS la |h|. In this study, the three new states are identified as Y = x%i X — 1 an d N = Xv The 
visible particles leaving each branch are all opposite-sign same-flavor (OSSF) leptons (yu or e). This allows 
us to group hadronic activity into the vector k identified as upstream transverse momentum (UTM). 

The paper is structured as follows: Section || introduces the definition of M%c- At this stage we assume we 
know the two mass differences, an assumption which will be justified later in the paper. Section ^ discuses 
the dependence of M%c on complications from combinatorics, large UTM, missing transverse-momentum 
(fx) cuts, parton distributions, and energy resolution. Section |] applies M^c variables to HERWIG data from 
the benchmark supersymmetry spectrum SPS la. Section ^ shows how combining the edge from Eq. (^) 
with M3C one also finds the two mass differences My — Mjv and Mx — Mjv- Finally in Sec. ^ we summarize 
the papers 's contributions. 
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2 Introducing M^c 



We will now introduce the definition of M 3 q and its relationship to previous mass-shell techniques (MST). 
2.1 Definition of M 3C 

The upper-bound and lower-bound on the mass of the third lightest new particle state in the symmetric 
decay chain are the constrained mass variables M 3 c,lb and M^cub- This variable applies to the symmetric, 
on-shell intermediate state, topology from Fig. ^ which depicts two partons that collide and produce some 
observed UTM with four momenta k and an on-shell, pair-produced new state Y. On each branch, Y decays 
to on-shell intermediate particle state X and a visible particle v\ with masses Mx and m Vl . Then X decays 
to the dark-matter particle N and visible particle vi with masses Mn and m V2 . The four- momenta of V\ , 
V2 and N are respectively ai, «2 and p on one branch and /3i, 02 and q in the other branch. The missing 
transverse momenta f T is given by the transverse part of p + q. 

We initially assume that we have measured the mass differences from other techniques. For an on-shell 
intermediate state, there is no single end point that gives the mass difference. The short decay chain gives 
a kinematic end point maxm^ described in Eq. (|^) that constrains a combination of the squared mass 
differences. Unless two of the states are nearly degenerate, the line with constant mass differences lies very 
close to the surface given by Eq. (|^). The two mass differences are often tightly constrained in other methods. 
The mass differences are constrained to within 0.3 GeV from studying long cascade decay chains where one 
combines constraints from several end points of different invariant-mass combinations Q . In principle mass 
differences may also be found by using Mt2 with different assignments of upstream transverse momentum 
and missing transverse momentum as described in Ref. Jr7| . After initially assuming that we know the mass 
difference, we show that our technique can also find the mass differences. The M%c distribution shape is 
a function of both the mass scale and mass differences. We can constrain both the mass differences and 
the mass scale by fitting the maxmi2 edge constrains and the ideal M^c{^n, AMyn, AM^jv) distribution 
shapes to the observed Msc(AMyn, AMxn)- To find all three parameters from this fit, we will take Mn, 
AMyjv, and AMxn as independent variables. 

For this first phase of the analysis, let us assume the mass differences are known. For each event, the 
variable M^clb is the minimum value of the mass of Y (third lightest state) after minimizing over the 
unknown division of the missing transverse energy f T between the two dark-matter particles N: 



m\ G lb (AM Y n, AMxn) = min (p + on + a 2 ) 2 (3) 
Constrained to 

(p + q) T = fr (4) 

V(«i+«2+p) 2 -VPy = AMyn (5) 

V("2+p) 2 - VP) = AMxn (6) 

{ ai +a 2 +pf = (A + fa + qf (7) 

(a 2 + P ) 2 = {(32 + qf (8) 

P 2 = q 2 (9) 



where AMyn = My — Mn and AMxn = Mx — Mn- There are eight unknowns in the four momenta 
of p and q and seven equations of constraint. Likewise we define Msc.ub as the maximum value of My 
compatible with the same constraints. We discuss how to numerically implement this minimization and 
maximization in appendix |a]. Because the true p and q are within the domain over which we are minimizing 
(or maximizing), the minimum (maximum) is guaranteed to be less than (greater than) or equal to My- 
Just as with M 2 c, events with M 3 c (upper or lower) near the end-point are nearly reconstructed [] Wc 
provide a proof of the uniqueness of such a reconstruction in appendix [b]. 

1 In-principle we expect to find some events close to the end point. In practice it may be more challenging with M^q than 
with M20 The M^c variable is more sensitive to energy-resolution errors so the sharp cut off at the correct mass becomes a 
sloping gradual tail as seen in Fig ^ which obscures events are near the end-point. The M2C en d point is better for the purpose 
of reconstruction. Although fewer events will be near this end-point, the likelihood of energy resolution errors to completely 
mess up the results is less. This reconstruction near the endpoint is being exploited to study spin with the M^2 assisted on-shell 
(MAOS) reconstruction [H. 
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Figure 2: Ideal M^c,lb and M$c,ub distribution for 25000 events in two cases both sharing AMyjv = 100 
GeV and AMxn = 50 GeV. The solid, thick line shows My = 200 GeV, and the thin, dashed line shows 
M Y = 250 GeV. 

Figure || shows an ideal M^c,lb and M^c,ub distributions for 25000 events in two cases both sharing 
AMyn = 100 GeV and AMxn = 50 GeV. The dashed lines represent the distributions from events with 
My = 250 GeV, and the solid lines represent the distributions from events with My — 200 GeV. One can 
clearly see sharp end points in both the upper-bound and lower-bound distributions that give the value My. 
The upper-bound distribution is shown in lighter line (red online). 

We expect an event to better constrain the mass scale if we are given additional information about that 
event. In comparison to M^c where Y decays directly to N, here there is an on-shell intermediate state X. 
The extra state X and information about its mass difference AMx n enables Mac to make an event- by-event 
bound on My stronger than in the case of M^c- We will see that this stronger bound is partially offset by 
greater sensitivity to errors in momentum measurements. 

The variable M^c, like other variables we have discussed Macs Mti Mt and Mqt, is invariant under 
longitudinal boosts of its input parameters. We can understand this because all the constraint equations 
are invariant under longitudinal boosts. The unknown p and q arc minimized over all possible values fitting 
the constraints so changing frame of reference will not change the extrema of the Lorentz invariant quantity 
(p + a) 2 . 

2.2 Relation to other Mass-Shell Techniques 

There are several tools being studied to determine the mass of the dark-matter particle in a model-independent 
manner. We now discuss M^c i n the context of recent papers. 

The variable M^c is a constrained mass variable where we find minimum and maximum mass allowed 
by an event given the combination of on-shell requirements for an assumed topology, the observed missing 
transverse momentum, and mass differences which may extracted from kinematic end-points or other tech- 
niques. The variable M 2C |] @ was the first example of a constrained mass variable. This paper's M^c is 
the next example. 

The kinematic variable Mt2 f^0) (in addition to the original definition) can be seen as the lower-mass 
boundary of the region with these minimal kinematic constraints and an assumed mass for the dark-matter 
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particle Mt2 can be applied to subsystems of an assumed topology as outlined in jl7| and further 

studied more recently in p2[ |. The variable M^c does not use Mti as a calculation tool, but the variable 
M3C can also been seen as the boundary of the 'non-minimal]^] kinematic constraint mass region pi] ] for the 
topology with three on-shell new states shown in Fig [l]. 

Mass-shell techniques (MSTs) require the consistency of each event with missing transverse momentum 
and with on-shell requirements for an assumed topology. Mass-shell techniques |Q encompasses the polyno- 
mial methods described in Ref. and encompasses the minimal kinematic constraints method described 
in Ref pl| as well as many others reviewed in our earlier work|l[|2|. 

Since the publication of Ref. Jl[ , the combination of mass-shell constraints with kinematic end-point con- 
straints has stated being referred to hybrid mass-shell techniques fn], |2^| . The MST most closely associated 
with M 3 q is that of Cheng, Gunion, Han, Marandella, McElrath (CGHMM) which describe counting 
solutions at assumed values for the mass for Y , X, and N . By incorporating a minimization or maximiza- 
tion, we enhance CGHMM's approach because we have a variable whose value changes slightly with slight 
changes of the inputs instead of the binary on-off that CGHMM have with the existence of a solution^. As a 
hybrid technique, we also incorporate knowledge of the added information from other measurements which 
accurately determine the mass differences. Finally, the quantity M%c can form a distribution whose shape 
tells us information about the masses. Because for most events there is only one "turn-on" point below My, 
the distribution M 3 c,lb is very similar to the derivative of the Fig. 8 of CGHMM[|l(| to the left of their 
peak and M 3 c,ub is similar to the negative of the derivative to the right of their peak. They differ in that 
there may be multiple windows of solutions; also CGHMM's Fig. 8 is not exactly along the line of fixed mass 
differences; and the effect of backgrounds and energy resolution are dealt with differently. 

We also hope to show that the use of the distribution's shape enables us to exploit the essentially non- 
existent dependence of the distributions on the unknown collision energies and incorporate the dependency 
on UTM directly. This diminishes the dependence of the measurement on the unknown model while still 
allowing us to exploit the majority of the distribution shape in the mass determination. 

After studying previous MSTs, we were tempted to use Bayes' theorem with a parton distribution function 
as a likelihood function as was done in Goldstein and Dalitz [^3| and Kondo, Chikamatsu, Kim [^4| (GDKCK). 
They used the parton distribution function to weight the different mass estimates of the top-quark mass (My 
in our topology). We found that such a weighting leads to a prediction for My much smaller than the true 
value. This can be understood because the parton distributions make collisions with smaller center-of-mass 
energies (small x) more likely, therefore the posterior will prefer smaller values of My which are only possible 
for smaller values of x. Only if one includes the cross section for production, i.e. the likelihood of the event 
existing at all, in the Bayes likelihood function will we have the appropriate factor that suppresses small 
values of x and therefore small values of My. This balance therefore leads to the maximum likelihood (in 
the limit of infinite data) occurring at the correct My. Unfortunately, inclusion of the magnitude of the 
cross section introduces a large model dependence. In the case of the top-quarks mass determination, the 
GDKCK technique gives reasonable results. This is because they were not scanning the mass scale, but 
rather scanning \y (the top-quark mass) while assuming xn = Mn = and Xx = Mx = M\y- The 
likelihood of solutions as one scans xy rapidly goes to zero below the true top-quark mass M top . The parton 
distribution suppresses the likelihood above the true M top . The net result gives the maximum likelihood 
near the true top-quark mass but suffers from a systematic bias p5i ]2(| that must be removed by modeling 




3 Factors for Successful Shape Fitting 

One major advantage of using the M 3 c distribution (just as the M 2 c distribution |l|[||) is that the bulk 
of the relevant events are used to determine the mass and not just those near the end point. To make the 
approach mostly model independent, we study on what factors the distributions shape depends. We show 
that there is a strong dependence on UTM, but because the UTM distribution can be measured from the 
data this does not increase the model dependency. We show there is no numerically significant dependence 
on the collision energy which is distributed according to the parton distribution functions. This makes the 

2 The term 'non-minimal' is because we assume the more complex topology involving an on-shell intermediate state. 
3 We am grateful to Chris Lester for pointing out to me the importance of this feature. 
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Figure 3: (Left) The M^c distributions before (solid) and after (dashed) introducing the combinatoric 
ambiguity. (Right) The M^c distributions with and without UTM. The no UTM case (fcx = 0) is shown by 
the solid line; the large UTM case with kr = 250 GeV is shown by the dashed line. 



distribution shape independent of the production cross section and the details of what happens upstream 
from the part of the decay chain that we are studying. We model these effects with a simple Mathematica 
Monte Carlo event generator assuming My = 200 GeV, M x = 150 GeV, and M N = 100 GeV. 

• Effect of Combinatorics Ambiguities 

Just as in the topology in Refs studied earlier, where %2 decays via a three body decay, the branch 

assignments can be determined by either distinct OSSF pairs or by studying which OSSF pairs have both 
mi2 < maxmi2. In 90% of the events, there is only one combination that satisfy m\2 < maxmi2. This 
allows us to know the branch assignment of 95% of the four-lepton events without ambiguity. 

Unlike the three-body decay case, the order of the two leptons on each branch matters. The intermediate 
mass Mx = {q.% + p) 2 depends on a.2 and does not depend on a.\. To resolve this ambiguity we consider 
the four combinations that preserve the branch assignment but differ in their ordering. The M^,c,lb for the 
event is the minimum of these combinations. Likewise the M^c,ub is the maximum of these combinations. 
As one expects, Fig. || (Left) shows how the combinatorics ambiguity degrades the sharpness of the cut-off 
at the true mass. Not all applications share this ambiguity; for example in top-quark mass determination 
(pair produced with Y = top quark, X = W ± , N = v) the 6-quark-jet marks a.\ and the lepton marks a.2- 

• Effect Large Upstream Transverse Momentum 

In a similar behavior to M2C, the distributions of the variable M%c show a strong dependence on large 
UTM. In our case study this is identified as the combination of all the hadronic activity. Figure || (Right) 
shows the stronger upper-bound cut-off in the presence of large UTM. Unlike M2C, in M^c with hr — we 
still have events with nontrivial upper-bound values. 

We also tested the distribution for different values of k 2 . In Fig. || (Right) we fixed k 2 = (100 GeV) 2 . 
We also performed simulations with k 2 = (500 GeV) 2 and found the difference of the two M^c distributions 
consistent with zero after 15000 events. In other words, the distribution depends mostly on k x and k v and 
appears independent of fco- 

• The Effects of Detector Energy Resolution 

Compared to M2C1 t ne information about the extra states gives a stronger set of bounds. Unfortunately, 
the solution is also more sensitive to momentum measurement error. We model the finite energy resolution 
by scaling the four-vector with gaussian centered around 1 with the following widths which are similar to 
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Effects of Energy Resolution 




Figure 4: The effect of energy resolution on the M^c distribution. (Left) The dotted line shows the energy 
resolution has washed out the sharp cut-off. (Right) M^c,lb with perfect energy resolution plotted against 
the result with realistic energy resolutions. 



the expected resolutions of the ATLAS |28| and CMS|29| detectors. 
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The hadronic energy-resolution error, which is larger than the leptonic energy-resolution error, will increase 
the uncertainty in the missing transverse momentum. 

Figure^ shows the effect of realistic leptonic energy resolution for the case kx — on the M$c distribution. 
On the left we show the energy resolution (dashed line) compared to the perfect energy resolution (solid 
line). The energy resolution washes out the sharp cut-off. On the right we show M 3 c,lb with perfect energy 
resolution plotted against the result with realistic energy resolution. This shows that the cut-off is strongly 
washed out because the events with M^c closer to the true value of My (200 GcV in this case) are more 
sensitive to energy resolution than the events with M^c closer to AMyn- The peak in the upper-bound 
distribution at M^c,ub — 100 GeV comes from events that no longer have solutions after smearing the 
four-momenta. 

Because the energy resolution affects the distribution shape, its correct modeling is important. In the 
actual LHC events the fx energy resolution will depend on the hadronic activity in the events being consid- 
ered. Two events with the same kx = may have drastically different f T resolutions. Modeling the actual 
detector's energy resolution for the events used is important to predict the set of ideal distribution shapes 
which are compared against the low-statistics observed data. 

We do not consider lepton-isolation cuts. As described in Ref J2J, including such effects changes the 
distribution shape and decreases the statistics available with which to form the distributions. For example 
a lepton isolation cut of AR = \J A</> 2 + Arf > 0.1 excludes about 3% of the distribution events or using 
AR > 0.3 excludes about 20% of the distribution events. As lepton-isolation uncertainties and energy- 
resolution uncertainties are detector-specific, it would be interesting to study the distribution shape specific 
to different detectors. 

• Parton Distribution Function Dependence 

For a mostly model-independent mass-determination technique, we would like to have a distribution that 
is independent of the specific production mechanism of the assumed event topology. The parton distributions 
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Figure 5: The dependence of the M^c distributions on the parton collision energy. The solid line shows the 
collision distributed according to Eq. ([l3]) , and the dashed line shows the collision energy fixed at y/s = 600 
GeV. 



determine the center-of-mass energy yfs of the hard collisions, but the cross section depends on model- 
dependent couplings and parameters. The events we consider may come from production of different initial 
states (gluons or squarks) but end in the assumed decay topology. The M$c distribution, like the M-xc 
distribution, shows very little dependence on the underlying collision parameters or circumstances. 

Fig. [| (Left) shows the dependence of the M%c distributions on the parton collision energy. The solid 
line shows the M%c distributions of events with collision energy yfs distributed according to 



p(VJ) = 12 M| 



4M h 



(13) 



and the dashed line shows the M^c distributions of events with fixed y/s = 600 GeV. Figure || (Right) 
shows the difference of these two distributions with 2er error bars as calculated from 15000 events. The two 
distributions are equal to within this numerical precision. 

• Effects of f T Cuts 

As described in fl30| , |3l| , |], || , the standard-model four-lepton events with missing transverse momentum 
backgrounds are very strongly suppressed after a missing transverse momentum cut. This requires an analysis 
of what effect a fx > 20 GeV cut will have on the distribution shape. Figure || shows that the effect of this 
cut is significant dominantly at small M^c- On the left we see the M^c.lb result versus the f T . Unlike the 
M 2 c case in Rcf the M 3 c solutions in Fig. ^ do not correlate with the f T . The right panel of Fig. ^ 
shows the difference between the M^c,ub and M^c.lb distributions with and without the cut f T > 20 GeV. 
The smallest bins of M^c,lb are the only bins to be statistically significantly affected. The left-side suggests 
this lack of dependence on f T cuts is somewhat accidental and is due to the nearly uniform distribution of 
M3C solutions being removed by the cut. The stronger dependence of the smallest M$c bins on the fj- cut 
means we can either model the effect or exclude the first bins (about 10 GeV worth) from the distribution 
used to predict the mass. We will choose the latter because we will find that the background events also 
congregate in these first several bins. 

• Spin correlations 

In our simulation to produce the ideal curves, we assumed each decay was uncorrclated with its spin in 
the rest frame of the decaying particle. Spin correlations at production may affect this. However, such spin 
correlations are washed out when each branch of our assumed topology is at the end of longer decay chain. 
These upstream decays are the source of considerable UTM. 

Some spin-correlation information can be easily taken into account. The 777,12 (or 77734) distribution's shape 
is sensitive to the spin correlations along the decay chain p2][p3|. The observed mi2 (or 77734) distribution 
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Figure 6: The effect of missing transverse momentum cuts on the M$c distributions. (Left) The M^c.lb 
result versus the f T . (Right) The difference of the M 3 c,ub and M%c,lb distributions with and without the 
cut f T > 20 GeV. The smallest bins of M 3 c,lb are the only bins to be statistically significantly affected. 



can be used as an input to producing the ideal distribution shape. In this way spin correlations along the 
decay chain can be taken into account in the simulations of the ideal distributions. 

Spin correlations between the two branches can also affect the distribution shape. To demonstrate this 
we modeled a strongly spin-correlated direct production process. Figure [?] (Left) shows the spin-correlated 
process that we consider. Figure |?] (Right) shows the M^c upper-bound and lower-bound distributions 
from this process compared to the M%c distribution from the same topology and masses but without spin 
correlations. We compare distributions with perfect energy resolution, m vi = m„ 2 = GeV, My = 200 GeV, 
Mx = 150 GeV, and Mm = 100 GeV. Our maximally spin-correlated process involves pair production of Y 
through a pscudoscalar A. The fcrmion Y in both branches decays to a complex scalar X and visible fcrmion 
V\ through a purely chiral interaction. The scalar X then decays to the dark-matter particle N and another 
visible particle v%. The production of the pseudoscalar ensures that the Y and Y are in a configuration 

\/2 (| 11) + | It))- The particle Y then decays with X preferentially aligned with the spin. The Y decays 
with X* preferentially aligned against the spin. Because X is a scalar, the particle N decays uniformly in 
all directions from the rest frame of X. The correlated directions of X causes the two sources of missing 
transverse momentum to be preferentially parallel. The resulting greater magnitude of missing transverse 
momentum increases the cases where M^c has a solution closer to the end point. For this reason the spin 
correlated distribution (red dotted distribution) is above the uncorrelated distribution (black thick lower- 
bound distribution and blue thick upper-bound distribution). The upper-bound distribution is statistically 
identical after 25000 events. The lower-bound distribution clearly has been changed, but not by very much 
compared to the other factors on which the distribution shape depends that are described in this section. 
This is due to the subsequent decay of the X particle which lessens the likelihood that the two Ns will be 
parallel. For the remainder of the paper we assume no such spin correlations are present. 

• Backgrounds 

The standard model (SM) backgrounds for four-leptons and missing transverse momentum are studied 
in [^o[ In two previous publications we summarized the SM backgrounds and the dominance 

of supersymmetry (SUSY) backgrounds for this channel. As was mentioned earlier, the SM backgrounds 
for four leptons with missing transverse momentum are very strongly suppressed after a missing transverse 
momentum cut. 

To improve the quality of the fit, a model for backgrounds can be created based on assumptions about 
the origin of the events and wedge-box analysis like those described in Bisset eta/jTj and references therein. 
We performed such a model in the Ref. ||] and found the distribution shape isolated the correct mass of 
Xi to within 1 GeV with versus without the background model. In the studies of M%c and SPS la in this 
paper, the insensitivity to the background is again observed. If the background is present but not modeled, 
we observe a somewhat lower fit quality than in the absence of background, but the shift in Mj° is less than 
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Figure 7: Effect of a spin correlated process on the M 3 c distributions. Modeled masses are My = 200 GeV, 
Mx = 150 GeV, and Mjv = 100 GeV. The thick black and thick blue lines show the distributions of the 
uncorrelated lower-bound and upper-bound M 3 c- The dotted red lines show the distributions of the spin 
correlated process. 

1 GeV which is within the predicted uncertainty. In the SPS la example studied in the next section, SUSY 
background events form about 12% of the events. As such, we do not try to model the background in this 
M 3C study. 



4 Estimated Performance 

With an understanding of the factors affecting the shapes of the M 3 c distributions, we combine all the 
influences together and consider the mass-determination performance. We follow the same modeling and 
simulation procedures used in Ref Q except now we include an on-shell intermediate state and calculate 

3~6] to generate events according to the SPS la benchmark point [n6| 



M 3C . We use HERWIG 34 3 



an mSUGRA point with m = 100 GeV, 



l l/2 



250 GeV, A 



This is 

TOO GeV, tan/3 = 10, and sign(/i) = +. 
We initially assume the modeled mass differences of AMyn = 80.8 GeV and AMxn = 47.0 GeV have been 
previously measured and take them as exact. We later show how the M 3 c distribution-shape changes with 
the mu end point also can be used to solve for the two mass differences. 

The HERWIG simulations select the charged leptons (e^ and /i^) produced in the decay of heavy objects 
(SUSY particles and W and Z bosons) for further study provided they satisfy basic selection criteria on 
transverse momentum (j>t > 10 GeV) and pseudorapdity (|?7| < 2.5). Leptons coming from hadron decays 
are usually contained within hadronic jets and so can be experimentally rejected with high efficiency using 
energy or track isolation criteria. This latter category of leptons was therefore not used in this study. The 
acceptance criterion used for the hadronic final state was \rj\ < 5. The detector energy resolution functions 
used are described in Sec. |[ 

Like M2C, the M 3 c distributions can be well-predicted from observations. When we determine the masses 
based on distribution shapes, the larger the area difference between two distributions representing different 
masses, the more accurately and precisely we will be able to tell the difference. Unfortunately, the M 3 c 
distribution is sensitive to finite momentum-resolution errors and combinatoric errors which have the effect 
of decreasing the large area difference between the distributions of two different masses shown in Fig. g. 

Just as in Ref Q , we model the distribution shape with a simple Mathematica Monte Carlo, and compare 
the predicted distribution shapes against the HERWIG data modeling the benchmark point SPS la. We again 
use the observed UTM as an input to the Mathematica simulated ideal distributions. By modeling with 
Mathematica, which does not use SUSY cross sections, and comparing to more realistic HERWIG-generated 
data, we hope to test that we understand the major dependencies of the shape of the M 3 c distributions. 
The Mathematica event generator produces events based on assumptions of a uniform angular distribution 
of the parents in the center-of-mass frame, the parent particles decay with a uniform angular distribution in 
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SPS la with 100 fb" 1 of Events 

SPS 1 a HERWIG Simulation And Ideal Curve 1 207 signal and 1 66 background 

All Effects Included, 1372 Events M x «= 98.6+/-2.2 GeV 




«3 CLE M 3 c t UB (OcV) M(f), <GeV) 



Figure 8: Fit of ideal M^ciM^o) distributions to the HERWIG generated M^c distributions. Includes com- 
binatoric errors, backgrounds, energy resolution, and fx cuts. (Left) The observed HERWIG counts ver- 
sus the expected counts for ideal — 95 GeV. (Right) The \ 2 fit to ideal distributions of M^> = 
80, 85, 90, 95, 100, 105, 110 GeV. The correct mass in HERWIG is M % o = 96.0 GeV. 



the rest frame of the parent. The particles are all taken to be on shell. The effect of kx > is simulated by 
boosting the event in the transverse plane to compensate a specified kx- 

The results are shown in Figure ||. The left side of Fig. || shows the M%c lower-bound and upper-bound 
counts per 5 GeV bin from the HERWIG generated data, and it shows the predicted ideal counts calculated 
with Mathcmatica using the observed UTM distribution and assuming = 95 GeV. The upper-bound 
and lower-bound distribution show very close agreement. The background events are shown in dotted lines 
and are seen accumulating in the first few bins. These are the same bins dominantly affected by f T cuts. For 
this reason we excluded these first two bins from the distribution fit. The right side of Fig. || shows the \ 2 fit 
of the HERWIG simulated data M^c distribution to the ideal M?,c{M^°) distribution with taken as the 
independent variable. Ideal distribution shapes are calculated at values of = 80, 85, 90, 95, 100, 105, 110 
GeV. The x 2 fitting procedure is described in more detail in appendix of Ref. H except with M 2 c replaced 
with M3C and with the background fraction A of the ideal distribution being tested against fixed at 0. All 
effects discussed in this paper are included: combinatoric errors, SUSY backgrounds, energy resolution, and 
fx cuts. Our ideal curves were based on the Mathcmatica simulations with 25000 events per ideal curve. 
Despite the presence of backgrounds, the \ 2 P er degree of freedom (number of bins) is not much above 1 per 
bin. 

The particular fit shown in Fig. ^ gives M%° = 98.6 ± 2.2 GeV where we measure uncertainty by using 
the positions at which % 2 is increased by one. We repeat the fitting procedure on nine independent data 
sets each with ps 100 fb -1 of HERWIG data (w 1400 events for each set). The mean and standard deviation 
of these nine fits give = 96.8 ± 3.7 GeV. After 300 fb -1 one should expect a y/3 improvement in the 
uncertainty giving ±2.2 GeV. The correct mass in HERWIG is = 96.05 GeV. 

5 Finding the mass differences using mu edge and M%c 

Our technique also enables a combined fit to both the mass differences and the mass scale. The m« end 
point in Eq. (J2J) constrains a relationship between the three masses. Gjelsten, Miller, and Osland estimate 
that this edge can be measured to better than 0.08 GeV || ^| using many different channels that lead to the 
same edge, and after modeling energy resolution and background. In the next several paragraphs we show 
that by combining this edge with the fits to the M%c upper-bound and lower-bound distribution shapes, we 
can constrain all three masses. 

We first numerically calculate the effect of errors in the mass differences. We use simulated data corre- 
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<5AM YN (GeV) 

Figure 9: Combined constraint from fitting both maxmn and Msc with the mass difference as free param- 
eters. We parameterized the difference from the true values in the model by AMyn = 80.8 GeV + SAMyn 
and AM X N = 47.0 GeV + 5AM X n- We shown the 1, 2, 3cr contours. 

sponding to 300 ftV 1 (about 3600 signal and 450 background events) including all the effects discussed. We 
parameterize the error from the correct mass difference in the model by the variables SAMyn and SAMxn 
so that mass differences are given by AMyn = 80.8 GeV + 5AM Y n and AM X n = 47.0 GeV + SAMxn- 
We calculate the Xm 3C at 8 points surrounding the correct mass difference by amounts SAMyn = ±1 GeV 
and SAMxn = ±1 GeV. The minimum Xm 3C at each of the 9 points gives the value of M^° for each mass 
difference assumed. The position of the minima can be parameterized by a quadratic near the true mass 
difference. The resulting fit 

Mgo = 96.4 + 1.9 (SAMxn) 2 + 2.5 SAM YN SAMxn + 3-25 AM XN - 3.8 {SAM YN ) 2 - 8.3SAM YN (14) 

shows in units of GeV how the mass M^>° is affected by small errors in the mass difference. 

The Xm 3C (f° r ^ bins and without a backgrounds model) at these 9 different values for the mass difference 
provides another constraint on the mass differences. Fitting the minxM 3C to a general quadratic near the 
true mass difference gives 

minxlf 3C = 162 + 38 (SAM X n) 2 -8SAM yn SAM X n -B5AM X n ~ 25SAM YN - (15) 

The minx|/ 3C . described by Eq. (|l5|) shows a sloping valley. The sides of the valley constrain SAMxn as 
seen by the large positive coefficient of (SAMxn) 2 - The valley slopes downward along SAMyn as can be 
seen by the large negative coefficient of SAMyn which leaves this axis unbounded within the region studied. 

The unconstrained direction along AMyn can by constrained by the mass relationships given by the end 
point maxmjj or by Mti as described in Ref. |l7]|. Here we work with maxm;; to provide this constraint. Wc 
calculate the Xmaxmn using S(m&xmu) = 0.08 GeV, and Eq. (0) with My = AMyn + SAMyn + M%o and 
M x = AMxn + SAMxn + M^o where we use M^° from Eq. fll4p. This Xmaxm;, constrains a diagonal path in 
(SAMyn, SAMxn)- The value of the Xmaxmu a * l ne minimum is a constant along this path. The combined 
constraint Xm 3C ^° Xmaxm,, leads to the a minimum at SAMyn = 0.18 GeV and SAMxn = 0.25 GeV where 
M^° = 95.7 GeV as shown in Fig. |9[ We have shown the contours where x 2 increases from its minimum 
by 1,2 and 3. The uncertainty in the mass differences around this minimum is about ±0.2 GeV. The small 
bias from the true mass differences is due to the unconstrained Xm 3C along SAMyn and disappears with 
increasing statistics. We can also use modeling to deduce the unbiased mass differences. 
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Putting together all effects and propagating the effects of uncertainty in the mass differences, we estimate 
a final performance of = 96.4 ± 2.4 GeV after 300 fb _1 with about 3600 signal events amid 450 

background events. We find the mass differences (without bias correction) of — = 81.0 ± 0.2 
GeV and M~ 1r - M$o = 44.3 ± 0.2 GeV. This is to be compared to the HERWIG values of = 96.0 GeV, 
Myo - Myo = 80.8 GeV, and Mr - M*° = 44.3 GeV. 

6 Discussion and Conclusions 

How does this performance compare to other techniques? Because SPS la is commonly used as a test case, we 
can approximately compare performance with two different groups. The technique of |], |^, which uses 
edges from cascade decays, determines the LSP mass to ±3.4 GeV with about five hundred thousand events 
from 300 fb _1 . The approach of CEGHM JlJ] assumes a pair of symmetric decay chains and assumes two 
events have the same topology and intermediate states. They reach ±2.8 GeV using 700 signal events after 
300 fb~ , but have a 2.5 GeV systematic bias that needs modeling to remove. Both techniques also constrain 
the mass differences. By comparison we find ±3.7 GeV after 100 fb _1 (1200 signal, 150 background) and 
estimate ±2.4 GeV after 300 fb _1 (3600 signal, 450 background) and propagating reasonable uncertainties 
in the mass differences. The uncertainty calculations differ amongst research groups. Some groups estimate 
uncertainty from repeated trials, and others use the amount one can change the mass before \ 2 increases by 
one. Without careful comparison under like circumstances by the same research group, the optimal method 
is not clear. What is clear is that fitting the M%c,lb and M^cub distributions determines the mass of 
invisible particles as well if not better than the other known methods in both accuracy and precision. 

In this paper, we have extended the constrained mass variable to the case with three new on-shcll particle 
states. We assume events with a symmetric, on-shell intermediate-state topology shown in Fig. Ej, We can 
either assume that we have measured the mass difference between these new states through other techniques, 
or combine our technique with the maxm^ edge to find both mass differences and the mass scale. The new 
constrained mass variables associated with events with these three new particle states are called M^c.lb 
and M^c.ub, and they represent an event-by-event lower bound and upper bound (respectively) on the 
mass of the third lightest state possible while maintaining the constraints described in Eqs. (Q)-(||). We 
have shown that most of the M^c distribution properties described in the Refs Jl|][Q carry through to M$c- 
The additional particle state and mass difference enable a tighter event-by-event bound on the true mass. 
The M3C distribution is more sensitive than the Mic distribution to the momentum and energy-resolution 
errors. Studying the performance on the SPS la benchmark point, we find that despite the energy-resolution 
degradation, we are able to determine to at least the same level of precision and accuracy as that found 
by using cascade decays or by using other MSTs. 
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A How to calculate M%c 



To find the M 3 c, we observe that if we assume masses of Y, X, and N to be ^ (xy,Xx,Xn) with the given 
mass differences then there are eight constraints 



(P + 0)t 

( ai +a 2 +p) 2 = ((3i+f3 2 +q) 2 
{a 2 +p) 2 = {(3 2 +q) 2 

2 2 

v = q 



Xy 

x\ 

Xn 
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AM YN + AM XN ) 2 
AM YN ) 2 



(16) 
(17) 
(18) 
(19) 



and eight unknowns, p^ and q^. The spatial momenta p and q can be found as linear functions of the th 
component of p and q by solving the matrix equation 
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where a = ct\ + a 2 and /? = f3\ 



/?2- We substitute p and q into the on-shell constraints 

(p(Po,q )) 2 



% - (?(Po,?o)) 



(20) 



(21) 
(22) 



giving two quadratic equations for p a and g Q . These give four complex solutions for the pair p Q and q . We 
test each event for compatibility with a hypothetical triplet of masses (xy, Xx, Xn) = (xy, Xy — AMyjv + 
AMxn,Xy — AMyx). If there are any purely real physical solutions where p > and q > 0, then we 
consider the mass triplet (xy , Xx, Xn) viable. 

As we scan xy while keeping the mass differences fixed, a solution begins to exist at a value less than or 
equal to My and then sometimes ceases to be a solution above My- Sometimes there are multiple islands 
of solutions. To find the M 3 c, we can test each bin starting at xy = AMy^ along the path parameterized 
by Xy and the mass differences to find the first bin where at least one physical solution exists. This is the 
lower-bound value of M 3 c for the event. 

Likewise for an upper bound. We begin testing at the largest conceivable mass scale we expect for the 
Y particle state. If a solution exists, we declare this a trivial Mzc,UB- If no solution exists, then we search 
downward in mass scale until a solution exists. 

A faster algorithm involves a bisection search for a solution within the window that starts at AMyn and 
ends at our highest conceivable mass. We then use a binary search algorithm to find at what Xy the solution 
first appears for M 3 c,lb or at what xy the solution disappears giving M 3 c,ub- There are rare events where 
there are multiple islands of solutions. This occurs in about 0.01% of the events with UTM and in about 
0.1% for kr — 250 GcV. In our algorithm we neglect windows of solutions more narrow than 15 GeV. We 
report the smallest edge of the lower-mass island as the lower and the upper edge of the larger-mass island as 
the upper bound. Because of the presence of islands, we are not guaranteed that for an event solutions exist 
everywhere between M 3 c,lb and M^c.ub- With the inclusion of energy-resolution errors and background 
events, we also find cases where there are no solutions anywhere along the path being parameterized. If 
there is no solutions anywhere in the domain we make M 3 c,LB to be the largest conceivable mass scale, and 
we set M 3 c ub = AAIyn- 



B Uniqueness of Event Reconstruction 

InRefs. @|] and in Sec. | we claim that the events near an end point of M 2 c and M 3 c distributions (events 
that nearly saturate the bound) are nearly reconstructed. This appendix offers a proof of the claim. To prove 
uniqueness, we need to establish that as M 3 c or M 2 c of an event (lower bound or upper bound) approach 
the end point of the distributions, the solutions with different values of q and p approach a common solution. 

4 We use x to distinguish hypothetical masses (xy, Xx, Xn) from the true masses (My, Mx, Mn)- 
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Figure 10: Shows the ellipses defined for p and q in Eqs. (|2lj)-(22) using the correct mass scale for an 
event that nearly saturates the M%c end point. For this event, the M%c hes within 1% of the end point and 
reconstructs p and q to within 4%. Perfect error resolution and combinatorics are assumed. 



We begin with M^c- Section ^ shows that there are at most four solutions given Mn, Mx and My 
formed by the intersection of two ellipses in (p , q ) defined by Eqs. (21)-(^2|) as shown in Fig. [lo| Consider 
the case that an event has a lower bound M^c near My. We are guaranteed that a solution occurs at the 
true mass scale when we choose the correct combinatoric assignments. The ellipses either have a discrete 
number of solutions or they describe the same ellipse (an unlikely case which we dismiss as not relevant to 
realistic events). As one varies the mass scale downward, the two ellipses drift and change shape and size 
so that four solutions become two solutions and eventually, at the value of M 3 q for the event, become one 
single solution. When the disconnection of the two ellipses occurs near the true mass scale, the value of M 3 c 
will be near the end point. The unique solutions for p and q given at M%c arc nearly degenerate with the 
true values of p and q found when one uses the true masses to solve for p and q. The closer M$c is to the 
endpoint the closer the two ellipses are to intersecting at a single point when the true masses are used and 
to giving a unique reconstruction. The example pictured in Fig. ^ show an event with M^c within 1% of 
the end point and where the p and q are reconstructed to within 4%. This shows that for M%c events that 
are near the end point allowing for any choice of combinatorics then nearly reconstruct the true values for p 
and q. If there are combinatoric ambiguities, one need to test all combinatoric possibilities. If the minimum 
combinatoric option has a lower bound at the end point, the above arguments follow unchanged. The above 
arguments can be repeated to show M 3 c,ub near the end point also reconstructs the correct p and q. 

Next we turn to M 2 q- For every event the lower-bounds satisfy M 2 c(My — Mn) < M 3 c(My — Mjy, Mx — 
Mjy). With Mic the propagator (p + a 2 ) 2 , which we can equate with x\i ^ s n °t fixed. The kinematically 
allowed values for \x are M N < x\ < M Y assuming the visible states a% and a 2 are massless. Eq. ( p0| ) 
shows that p and q solutions are linear in Xx with no terms dependent on \x alone or other powers of 
Xx- Including Xx as a f ree parameter in Eqs. (pl|)-(p2"|) leads to two ellipsoids (or hyperboloids) in the 
space (p , q , Xx)- We will assume without loss of generality that these are ellipsoids because the arguments 
follow unchanged if they are hyperboloids. Again, at the true mass scale we are guaranteed the two ellipsoids 
intersect at an ellipse. We again neglect the physically unlikely case the two ellipsoids are degenerate. Now as 
one varies the mass scale the two ellipsoids drift and change shape and size. The M 2 c value then corresponds 
to the mass scale where the two ellipsoids are in contact at one point. As we select events with a value of 
M 2 c that approaches the true mass scale the intersection of the two ellipsoids shrink to a point giving a 
unique reconstruction of p and q. The combinatoric ambiguities for M 2 q are avoided by selecting events 
with two distinct OSSF pairs. Events that saturate the upper bound of M 2 c also reconstruct p and q by the 
same logic as above. 
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